Desktop Search - How Contextual Information Influences Search Results & Rankings
نویسندگان
چکیده
1. MOTIVATION Sophisticated web search technology usually allows us to find appropriate documents in a few seconds. Finding these documents on our desktop is surprisingly more difficult, at least if we have been storing documents for a few years or more. This is improving somewhat with the recent crop of desktop search engines, but even with these tools, searching through our (relatively small set of) personal documents with the recent beta of Google Desktop Search is inferior to searching the (rather vast set of) documents on the web with Google. The main reason for this is that one of the distinguishing features of Google sophisticated ranking using PageRank and other features is unavailable on our desktop. This position paper explores first how the contextual information inferred from the users’ actions is used to extend search beyond simple full-text search. Second, we discuss how algorithms exploiting this information can provide efficient ranking of resources on our desktop. Regarding the first aspect, we propose activity-based metadata and relationships as sources of additional information in desktop search. This information, in many cases representing contextual information, is very useful for re-finding resources we already worked with, improving the recall in desktop search. For the second aspect, we discuss how PageRank-based ranking algorithms can exploit this contextual information and show how local and global ranking measures can be integrated in such a model, achieving at the same time personalization of rankings. Using contextual information thus shows considerable promise for extending efficient information access to our desktop, extending globally available information with user-centered activity-based information, and exploiting the unique information background we have available on our desktop. We are currently implementing first prototypes in the context of the open source Beagle project which aims to provide sophisticated desktop search in Linux.
منابع مشابه
I know I stored it somewhere - Contextual Information and Ranking on our Desktop
1 Motivation Future digital libraries will be distributed, and recent research has already explored some promising approaches focusing on distributed and peer-to-peer search and retrieval architectures, connecting distributed repositories efficiently and transparently. Another aspect, which has been less explored so far, is the role of the implicit personal repositories we all have on our deskt...
متن کاملPraana: A Personalized Desktop Filesystem
Owing to the absence of rich contextual cues, filesystem search has been full of noise, imprecision and very less recall. Contextual information in the form of annotations (tags defined by users or extracted from files) and data/process provenance would improve the personal search experience of the user. A filesystem architecture to support contextual desktop search is proposed. Implementation ...
متن کاملSemantically Enhanced Searching and Ranking on the Desktop
Existing desktop search applications, trying to keep up with the rapidly increasing storage capacities of our hard disks, offer an incomplete solution for the information retrieval. In this paper we describe our desktop search prototype, which enhances conventional full-text search with semantics and ranking modules. In this prototype we extract and store activity-based metadata explicitly as R...
متن کاملBeagle++: Semantically Enhanced Searching and Ranking on the Desktop
Existing desktop search applications, trying to keep up with the rapidly increasing storage capacities of our hard disks, offer an incomplete solution for information retrieval. In this paper we describe our Beagle desktop search prototype, which enhances conventional fulltext search with semantics and ranking modules. This prototype extracts and stores activity-based metadata explicitly as RDF...
متن کاملContext-aware Search for Personal Information Management Systems
With the fast growth of disk capacity in personal computers, keyword search over personal data (a.k.a. desktop search) is becoming increasingly important. Nonetheless, desktop search has been shown to be more challenging than traditional Web search. Modern commercial Web search engines heavily rely on structural information (i.e., hyperlinks betweenWeb pages) to rank their search results. Howev...
متن کامل